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Purely dissipative evolution equations are often cast as gradient flow structures, z = K{z)DS{z), 
where the variable z of interest evolves towards the maximum of a functional S according to a metric 
defined by an operator K. While the functional often follows immediately from physical consider¬ 
ations (e.g., the thermodynamic entropy), the operator K and the associated geometry does not 
necessarily so (e.g., Wasserstein geometry for diffusion). In this paper, we present a variational state¬ 
ment in the sense of maximum entropy production that directly delivers a relationship between the 
operator K and the constraints of the system. In particular, the Wasserstein metric naturally arises 
here from the conservation of mass or energy, and depends on the Onsager resistivity tensor, which, 
itself, may be understood as another metric, as in the Steepest Entropy Ascent formalism. This 
new variational principle is exemplified here for the simultaneous evolution of conserved and non- 
conserved quantities in open systems. It thus extends the classical Onsager flux-force relationships 
and the associated variational statement to variables that do not have a flux associated to them. We 
further show that the metric structure K is intimately linked to the celebrated Freidlin-Wentzell the¬ 
ory of stochastically perturbed gradient flows, and that the proposed variational principle encloses 
an infinite-dimensional fluctuation-dissipation statement. 

PACS numbers: 46.05.+b, 05.70.Ln, 05.40.-a 


Dissipative evolution equations (e.g., heat conduction, 
mass diffusion, interface motion) often follow variational 
principles, such as Onsager’s least dissipation of en¬ 
ergy mm and extensions, in particular those based on 
maximum entropy production (MEPPs [3l|4j) or Steep¬ 
est Entropy Ascent (SEA) [MZ])- Mathematically, these 
equations are often of gradient flow type, that is, they 
can be described by the steepest ascent/descent of a func¬ 
tional, such as the entropy. Here descent has to be mea¬ 
sured in a metric, which is neither provided by the afore¬ 
mentioned variational approaches, nor it is always intu¬ 
itive (e.g., Wasserstein metric for diffusion processes). In 
this article, we establish a variational framework based 
on the ansatz of maximal entropy production which sheds 
light on the geometry of purely dissipative evolution 
equations. This new approach (1) delivers a construc¬ 
tion of the gradient flow metric from conservation con¬ 
straints in the variational formulation; (2) extends On¬ 
sager’s principle to simultaneously account for conserved 
and non-conserved quantities in open systems; and (3) 
encloses an infinite-dimensional fluctuation-dissipation 
statement, as shown from a large deviation argument for 
stochastically perturbed gradient flows. The diagram of 
Fig. [^summarizes the connections established in this pa¬ 
per. 

I. BACKGROUND 

We sketch some of the most closely related varia¬ 
tional principles and provide a short summary on gra- 
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dient flows. The body of literature, both classic and 
recent, on these two topics is too large to be reviewed 
comprehensively here. 

A. Entropy production 

Onsager, in his celebrated papers mm generalized the 
transport laws, such as those by Fourier, Ohm or Pick, to 
account for a possible coupling between different physical 
processes. He proposed a general linear kinematic con¬ 
stitutive relation between fluxes J and forces X, that is, 
Ji = The conductivity matrix L may depend 

on the state variables (temperature, pressure, chemical 
potential, etc.), but not on their gradient |8], and is sym¬ 
metric as a result of the time reversal of the underlying 
atomistic equations of motion, = Lji. These two 
properties of the constitutive relations — linearity and 
symmetry of the conductivity tensor — can be equiva¬ 
lently expressed by means of the principle of least dissipa¬ 
tion of energy [T] (following Rayleigh’s nomenclature 0). 
Namely, let as = be the entropy production and 

$(J) = denote a local dissipation poten¬ 

tial, with the resistivity tensor R = being positive 
definite, then the variational principle reads 

max[a,(J,X)-$(J)]. (1) 

In Onsager’s words [1], ‘the rate of increase of the en¬ 
tropy plays the role of a potential’. Several general¬ 
izations of this extremum principle have since emerged 
in different fields encompassing climate [10], soft matter 
physics m , plasticity [T2| , biology [13] and quantum me¬ 
chanics m among others, and appear under the names of 
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FIG. 1. Summary of the connections established in this work between the maximum entropy production principle, gradient 
flow structure, large deviation principle and fluctuation dissipation relation K oc crcr*, with a defined by the stochastic gradient 
flow. 


Maximum Entropy Production Principles (MEPPs) [5} [1] 
and Steepest Entropy Ascent (SEA) [5H7]. This latter 
framework provides a geometric interpretation of the re¬ 
sistivity tensor R and generalizes $ to arbitrary (but 
a priori unknown) metric spaces. Another approach 
to nonequilibrium thermodynamics, which combines re¬ 
versible and irreversible dynamics, is the General Equa¬ 
tion for NonEquilibrium Reversible-Irreversible Coupling 
(GENERIC) [121HB]. The structure of this formalism can 
be derived using contact forms in the setting of the Gibbs- 
Legendre manifold [13 HU; it can be cast variationally; 
and it allows for a systematic multiscale approach [19j as 
well as a treatment of fluctuations [nmol- 


B. Gradient flow structures 

From a mathematical perspective, purely dissipative 
evolution equations can often be described as gradient 
flow structures |21] . This means that the vectorial vari¬ 
able z of interest (components are, for example, energy, 
density or interface position) evolves according to the 
steepest ascent of a functional S (or descent for —S) in 
a geometry given by a metric associated with a positive 
semi-definite operator K, 

z = K{z)DS{z), or G{z)z = DS{z), (2) 

where G = K~^ if the inverse is defined, and DS := ^ 
is a force. Note that is precisely the irreversible com¬ 
ponent of GENERIG. Then S' is a Lyapunov functional, 
S = (DS, z) = (Gz, z) = {DS,KDS) > 0, where (,) 
denotes the dual parity between elements of the tangent 
and the cotangent space. 

Two common examples of ([^ are the flow and 


Wasserstein flow [55], written for scalar-valued z as 

z = m{z)DS{z), (3) 

i =-V • (M(z)VDS(z)), (4) 

respectively, with m > 0 and M positive semi-definite. 
The latter equation is symbolically expressed as i = 
Wj^^DS(z), 01 W-^z = DS{z), with K := := 

—V • MV. Further details on the weak formulations of 
both flows and the norms involved are given in the Ap¬ 
pendix. 

It is noteworthy that the same equation can have dif¬ 
ferent gradient flow representations. For example, the 
diffusion equation 

p{t, x) = V • (m(p(t, x))Vp(t, x)) (5) 

can be interpreted both as flow (with mobility 

mi := m and Dirichlet integral Si(p) := —| / iVppdx) 
and Wasserstein flow (with mobility M 2 := mpl and 
Boltzmann entropy «S '2 = — f^plogpdx). The Wasser¬ 
stein formulation is a natural choice since it involves the 
physical entropy. This flow and its associated metric will 
be automatically singled out by the variational principle 
proposed here, as we show next. 

II. ENTROPY PRODUCTION AND 
DETERMINISTIC EVOLUTION 

In this section, we present a new variational principle 
for purely dissipative evolution equations based on the 
ansatz that systems evolve in the direction of maximum 
entropy production (see Eq. ([^ below) [53] , so as to reach 
the equilibrium configuration as fast as possible. The 
philosophy is therefore similar to SEA and MEPPs, yet 
different in its detailed formulation. In particular, the 
proposed principle will provide a direct relation between 











3 



FIG. 2. Evolution of a closed system at z{t) (a) without and (b) with the presence of a constraint. The equilibrium configuration 
coincides with the maximum of the entropy functional S{z). 


the operator K and physical constraints in the system, 
thus shedding some light on the geometry of dissipative 
equations. 

For simplicity, we first consider closed systems defined 
by a scalar variable and later generalize the obtained re¬ 
sults to open systems and the vectorial setting. Illus¬ 
trative examples are then chosen to demonstrate the ap¬ 
plicability of the principle for both conserved and non- 
conserved fields, with explicit consideration of the bound¬ 
ary conditions. We note that non-conserved quantities 
do not have a flux associated to them, and therefore lie 
outside of the direct scope of Onsager’s principle Q. 

For a closed system out of equilibrium characterized by 
a scalar state variable z, the maximum entropy produc¬ 
tion ansatz is mathematically equivalent to the search of 
the velocity i maximizing S = J sdx = {DS, z ), where s 
is the entropy density and S the total entropy of the sys¬ 
tem. The maximization is pointwise in the tangent space 
for fixed z, c.f. Fig. However, this problem is not well- 
posed unless the length of the vector z is prescribed, in 
which case the problem is reduced to the search of the 
optimal direction. This constraint is easily incorporated 
with a Lagrange multiplier, yielding a variational princi¬ 
ple with Lagrangian 

V[z\ = {DS{z),z) - {z,r]{z)z), (6) 

where the precise value of the length, which may de¬ 
pend on z, has been obviated since it does not partic¬ 
ipate in variations for fixed z. The evolution is then 
obtained by variations of (§ with respect to i, giving 
z = m{z)DS{z), with m{z) = {2ri{z))~^ > 0, since en¬ 
tropy would decrease otherwise. This shows that the 
gradient flow ([^ with functional S naturally results from 
the maximum entropy production principle in the ab¬ 
sence of any physical constraint. 

However, the evolution of z is often subjected to con¬ 


servation constraints of the form 

which naturally occurs when z represents mass or energy. 
In this situation, the maximal dissipation occurs within 
the manifold of conserved z, 

i: -|- V • J = 0, 

where J = 0 on the boundary dfl for a closed system. 
With an additional Lagrange multiplier A, the variational 
principle at each point z can then be written as 

V[z,3,X] = (iA5,i)-(A(z),i + V-J)-(J,H(z)J), (7) 

where the length constraint (measured with metric ten¬ 
sor H) has now been placed on the unknown variable J. 
We note that constraining the length of i as in (§ would 
leave J partially undetermined, and so would be the con¬ 
stitutive relations, such as Fourier’s law for the case of 
heat conduction. 

Variation with respect to J in Q delivers 

0 = -(A, V • (5J) - (2HJ,(5J) for all (5J, 

which, after integration by parts, yields VA = 2HJ. 
Variations with respect to z and A give 

DS — A = 0 and z = —V • J. 

Altogether, this leads to a Wasserstein gradient flow with 
functional S and weight M = (2H)“^ positive semi- 
definite, i = —V • (MViAS'). The Wasserstein gradient 
flow can be thus be understood as an gradient flow 
restricted to the manifold of conserved quantities. 

In general, systems are characterized by a set of state 
variables z, some of which are conserved, z^, (e.g., energy. 
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concentration), and some of which are not, z„, (e.g., in¬ 
terface position), i.e., z = [zu,Zc]^. In this case the 
variational principle can be written as 

V[i, J, A] = {DS, z) - (A(z), z, + V • J) - (J, H,(z)J) 

H^(z)z^), (8) 

where now A is a vectorial Lagrange multiplier, and 
and He are second-order tensors. Similar derivations as 
above yield the evolution equations 

N-i 5S 

z„ = (2H„) —= Kuj —, 

dZy^ OZy 

=-V . ((2H,)-vg) = if.g. 

which have an analogous structure to those previously 
obtained. However, for anisotropic materials, coupling 
between variables of different tensorial quantities is pos¬ 
sible, and in this case, the Lagrangian shall be written 
as 


V[i, J, A] = {DS, z) - (A(z), + V • J) 

- ([z„, J]'^,H(z)[z„, J]"^). (9) 


Variations of this functional with respect to z„, Zc, J and 
A give 


2H 



/ 2H„ 2H„c\ f 
1,2HL 2hJ1,j 



with z'c = —V • J. Then, the evolution equations read 


production, in accordance with the second law of ther¬ 
modynamics. Assuming that the system is completely 
characterized by the internal energy, and taking also the 
conservation of energy into account, the Lagrangian reads 


X>[e,A,q]= f ^edx+ f dx 

Jn 9e Jga T 

— I X{e + \/-q}dx— f q^Hqdx, (10) 
Jn Jn 


where n is the outer normal to the domain, q is the 
heat flux, and s and e represent the entropy and en¬ 
ergy per unit volume, respectively. From basic thermo¬ 
dynamic relations, assuming local thermodynamic equi¬ 
librium, s = T^e. Therefore, variations with respect to e, 
A and q, assuming boundary conditions in T (boundary 
conditions in q would imply 5q = 0 on dU, and lead to 
the same evolution equation) yield 


-A = 0, 

T 

e -I- V • q = 0, 



AV • ((5q) 


2Hq (5q 


dx = 0 


V(5q, 


which combined give the equation of heat transfer, with 
K := (2Hr2)-i, 


^r = e = -V-(^(2H)-iv(^^)) =V-(KVr). (11) 


We remark that the analogous derivation via Onsager’s 
principle of least dissipation, i.e.. 



f M, 

1^-v • (M^,n) 


M„cVn A 

-V- (M,vn)j 


SS \ 

Al = KDS, 

5zc / 


V[X\ = q • X - ixLX, 


with X = —V 



( 12 ) 


where the symbol □ indicates how the operator is applied 
to the vector DS. Further, = 2H, i.e., 

- i [H„ - 

This simple viewpoint of dissipative evolution equa¬ 
tions via constrained maximization will be exemplified 
below for the equation of heat transfer and interface mo¬ 
tion in open system, as blueprint for the derivation of 
other equations in a similar manner. 

a. Example: the heat equation and Fourier’s law. 
We now show that Fourier’s law and the heat equation 
follow directly from the postulate of maximum entropy 
production. For an open system, the Lagrangian of the 
maximum entropy production principle is constructed by 
subtracting the entropy flow entering the boundary of the 
domain from the total entropy rate. Then the entropy in¬ 
crease considered exclusively originates from the internal 


leads to Fourier’s law q = LX, which, complemented 
with the first law of thermodynamics, yields 0 with 
L = (2H)“^. However, the physical motivation of the 
Lagrangian in Eq. (101 seems more natural than that of 

0. 

b. Example: Interface motion in an isotropic 
medium. Next, we consider a two-phase system sepa¬ 
rated by an interface, which we characterize by an addi¬ 
tional variable 4> in the spirit of a phase field model [24] . 
Following a similar strategy as in the previous case, the 
evolution of the interface coupled to the heat equation 
can be obtained as the extremum of 


A,q] = / sdx+ 

Jn Jon ^ 


dx 


— / A (e-1-V • q) dx — / q-p,qdx— / fpcfdx. 

Assuming the existence of a thermodynamic relation for 
the energy density e of the form e = e(s, V^), 


de = Tds + 


de 

d(j) 




s,V(f) 


s,(p 
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where subscripts indicate the variables that are held 
fixed. Its Legendre transform with respect to the entropy 
density s is the Helmholtz free energy /, 


df = -sdT+[^] d(l) + 


d(j} J 


s,V4> 




dS/(j). 


s,(p 


One then obtains 

1 1 

s = —e — — 
T T 


1 


1 


T® T 


(h 

d(j) 

dcj) 


s,'V4) 


T,Vcl> 






As in the previous example, we obtain the heat equation 
from variations of V with respect to e, A and q, while 
variations with respect to (j) yield the evolution of the 
interface, 


e = V • (yfcVT), with k := 

SJf/Tdx djf/T) djf/T) 

5(j) dcj) ■ 

Thus the interface is driven by the Massieu potential 
—f/T, whose relevance has been noted in SEA [5SH1Z], 
in the GENERIC setting [55] as well as in large deviation 
theory [29| . We note that the derivation of this evolution 
in the Onsager formalism is nontrivial as (j) does not have 
a flux and a corresponding thermodynamic force. 


III. STOCHASTIC EVOLUTION AND LARGE 
DEVIATIONS 

In this section, we show that the proposed variational 
formulation for purely dissipative equations based on 
physical considerations is further supported by a large de¬ 
viation principle (LDP) associated to stochastically per¬ 
turbed gradient flows. The LDP provides the probability 
of a given evolution to occur, and therefore intrinsically 
contains a variational principle for the most likely path. 
Large deviation arguments have recently been used to 
connect particle models to gradient flows, for example 
in [30ll32j . and have also led to variational formulations 
of systems in GENERIC form [55] . 

Specifically, let z — z{t, x) be a vector field that evolves 
in f G [0, T] according to a stochastic gradient flow with 
small noise, 

dz = K{z)DS{z)dt + a{z)\/e d'Bt^x, (13) 

where ^t,x is a vector of independent Brownian sheets, 
i.e., s)S{x-y), with <5^ the Kro- 

necker delta function and 5(x) the Dirac delta function. 
Eurther, a{z) is an operator acting on d'Bt,x, and e is 
a small parameter controlling the strength of the noise. 
The stochastic calculus is to be understood in the Ito 
sense. 


The probability distribution for z(f,x) satisfying (13) 


may be obtained from that of simpler processes using 
the theory of large deviations and the contraction prin¬ 
ciple |33|. Indeed, by Schilder’s theorem, the probability 
distribution of the solutions to the vectorial ordinary dif¬ 
ferential equation du — ^/ed^t, with B( a vector of time 
white noises, E[Bi{t)Bj{s)] = dij6{t — s), follows 

P[u(<) « <p(i)] oc where /[y>] = \ [ dt 

2 Jo 

(14) 

is called the rate functional. In words, the probability 
for u(t) undergoes an exponential decay with rate 1/e, 
and narrows as e —>■ 0 around the deterministic solution 
(f = 0. Then, the probability distribution for v(t,x) 
satisfying dv = ^/ed'Bt,x can be obtained by expanding 
v(t,x) and ^t,x with orthonormal basis functions efe(x) 
for the domain [551 |MI , 


v = ^A,"efc, B,., 


(15) 


where B/ are independent Brownian motions (direct 
computations show that E[Bi.^t^xBj-^s^y\ = SijS{t — s)S{x — 
y)). The partial differential equation dv = ^/ed'Bt,x is 
then equivalent to the system of vectorial ordinary dif¬ 
ferential equations dA^ = -^/edB/; and the rate func¬ 
tional of the associated large deviation principles, for 
if = (see, e.g., |3ll|36]), can be readily ob¬ 

tained from ( [T4| ) 

(16) 

0 ^ 0 

The solutions to © can be seen as z{t, x) = M'Rt,x = 
M(v/^/e), where M is an operator. If M is continuous 
(see m for measurable functions), then, by the contrac¬ 
tion principle, z follows a large deviation principle |38j 
with functional I[ip] = | dt, i.e.. 


P[z(f,x) « (p{t,x)] oc 

\\ip- K{ip) , (17) 


assuming (crcr*) ^ defines a norm, with a* being the ad¬ 
joint operator of a. This result follows the spirit of On¬ 
sager and Machlup [39], for general gradient flow struc¬ 
tures; however, the probability distribution obtained is 
not a function of the thermodynamic forces and fluxes as 
in the original formulation by Onsager, but of the vari¬ 
able z and z. This difference is analogous to that of (10) 
and (|T^. 


IV. MAXIMUM ENTROPY PRODUCTION 
FROM LARGE DEVIATIONS 


Equation (17) shows that the most likely path is the 


one that maximizes the exponent and thus minimizes 
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||z - K{^) dt. This minimum is at¬ 

tained by pointwise optimization (over z for fixed z at 
every instant of time), giving 

min||z-ii:(z) £>5'(z)||^^^.)_i . (18) 

Z ' 


Equation (18) represents a variational principle for the 
deterministic gradient flow, which, for K oc crcr*, is shown 
below to be equivalent to Eq. § for 17 gradient flows, to 
Eq.Q for the Wasserstein evolution, and to Eqs. (|^ and 
© for the combined vectorial case. Indeed, expanding 
the squares in Eq. (18) yields the variational problem 


max 

Z 


(Z?5'(z),z) — $(z) — 'I'(z) 


(19) 


which represents a fluctuation-dissipation relation in in¬ 
finite dimensions. 

Square root of the Wasserstein operator. 

We now discuss the expression a = \fK encoun¬ 
tered in the fluctuation-dissipation statement above for 
the Wasserstein operator. In general, for a given posi¬ 
tive semi-definite self-adjoint K there are several choices 
a\ij\ = a 20'2 = K. However, only a a* appears in the 
generator and thus the solutions to the corresponding 
Eokker-Planck equations for different roots are statisti¬ 
cally equivalent [10]. For Wasserstein gradient flows we 
only consider a = ^/K of divergence form, to have a 
conservative noise, i.e., crdB = V • j, where e = 1 for sim¬ 
plicity. Then, for the Wasserstein metric with mobility 
M 


with $(z) = i|lz|l^_i and T(z) = ^\\DS{z)\\j^, where 
the latter does not affect the optimal evolution. One has 
$ 7 ^ 'I' in the presence of fluctuations, whereas for the 
optimal path = 2<i) = 2'!' holds. 

The Lagrangian for the gradient flow {K = T^), 
Eq. (|^, can be rewritten in the form of (19), 


V = {DS{z),z) - ^\\z\\h^ = {DS{z),z) - $(i), 

with T] = and ||||l 2 as defined in the Appendix. An 
equivalent result is obtained for the Wasserstein gradient 
flow (Wm = K~^), noting that the last term of Eq. ([^, 
with H = and i -I- V • J = 0, can be rewritten as 

(J,H(^)J) = i(J,J)i2 = l{z,z)w^ = $(i). (20) 


The vectorial norm is defined analogously to the scalar 
case, and the second equality in (20) is detailed in the 


Appendix. Similarly derivations for the vectorial case 
considered in (j^ lead to 


(J,Hc(z)J) -f (z„,H„(z)i;„) = ||z«||^-i + l|zc||^-i 

= ||z||^_. =$(i), (21) 


with K ^ = diag ^). For the coupled case con¬ 

sidered in Eq. (|^ , AT is a full matrix and its inverse reads 


f 2H„ -2H„,(V-)-' \ 

-V-i2H,(V-)-V’ 


( 22 ) 


where the inverted divergence (V-)~^ and inverted gradi¬ 
ent are to be interpreted in appropriate spaces. The 
relations KK~^ = K~^K = / immediately follow from 
2MH = 2HM = I. Then, one similarly obtains that 


( 



HL hJ 




(23) 

The variational principles of Eqs. (1^ can therefore 
be written as min^ |jz — K{z) T>S'(z)][^_i. We thus ob¬ 
serve that the diagram of Fig. [^commutes if AT oc trcr*. 



a, 


L2 


ad'Bt,x,crdBt,x 


Wm. 


= 


= (24) 


from which one obtains that dBt^a, = M or equiv¬ 

alently, ad'Bt^x = V • (M^/^dBt^a;). For the diffusion 
equation ([^ with unit diffusion constant, the stochastic 
version given by (13) with a = '/K reads 


p = Ap -h V • 


(25) 


This equation of fluctuating hydrodynamics [41] is known 
as Dean-Kawasaki model [1^44] . 


V. CONCLUSIONS 

We provide two independent derivations of a varia¬ 
tional principle governing dissipative evolution equations 
of the form z = K{z)DS{z). The first is based on 
the maximization of the entropy production within the 
manifold of constraints, extending Onsager’s original ap¬ 
proach, and provides insight into the geometry of the 
gradient flow structure (AT). In particular, the principle 
captures multiple metrics: one which is related to a ther¬ 
modynamic length, and others that may result from the 
constraints in the system, such as conservation of mass 
or energy. The first metric is here taken as the met¬ 
ric and is in principle unknown (an extension to general 
metrics, as in SEA, is yet to be explored), whereas the 
second one is an outcome of the variational statement. 
By means of this procedure, the Wasserstein metric is 
here shown to be equivalent to the constrained metric 
associated to conserved fields. The second approach for 
obtaining the variational statement is based on the large 
deviation principle for the gradient flows augmented by 
a noise term (j{z)\fk d'Gt,x, and is shown to be equiv¬ 
alent to the previously derived principle for K oc crcr*. 
This represents a fluctuation-dissipation relation in in¬ 
finite dimensions and endows the exponent of the large 
deviation principle with the usual interpretation of an 
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entropy (dissipation) shortfall between a given path and 
the optimal one [45]. 


VI. APPENDIX 

We write the weighted LF' norm as = 

f m(x)v(x)w(x) dx, and denote := Lf and (,) := 
(,)i 2 (note that for square integrable functions, (,}l^ is 
equivalent to the duality pairing). Then the weak formu¬ 
lation of the gradient flow for the diffusion equation 
is (z := p and S' = in Eq. 

{p,v)l^ = {DSi{p),v)l2^ 

= -(Vp, = (V • {m{p)Wp),v)L2. 

For the Wasserstein gradient flow, if Zi = W^pi = 
—V • (MVpi) = V • Ji with Vpi = 0 on the Wasser¬ 
stein norm is 

(ii,i2)wM := (MVpi, Vp2)l2 = -(V • MVpi,P2)l2 


The second expression is known as the Hi seminorm 
with weight M, (pi,p 2 ) <>1 — (MVpi, Vp 2 )l 2 . We write 

i-rm 

(see (13 Appendix D] for details) 

(zi,Z2)wm = (MVpi,Vp2)l2 = (M ^Ji,J2)l2 

=: {zi,Z2)h^i . 

M-1 

With this notation, it is straightforward to calculate the 
weak formulation of the diffusion equation as a Wasser¬ 
stein gradient flow (z := p, M 2 = mpl = 7712 ! and S = S 2 
in Eq. Q), 

{P,Z2)w^2 = (Vi:>S2(p),m2Vp2(^2))L2- 
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